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Related Application 

[0001] This application hereby claims priority under 35 U.S.C. §1 19 to a 
Provisional Patent Application entitled, "The Design of MVM — a Multitasking 
Virtual Machine," filed March 15, 2001 by inventors Grzegorz J. Czajkowski and 
Laurent P. Daynes (Application No. 60/276,409). 

[0002] The subject matter of this application is related to the subject 
matter in a co-pending non-provisional application by the same inventors as the 
instant application entitled, "Method and Apparatus For Class Initialization 
Barriers in Multitasking Virtual Machines," having serial number 09/975,145, and 
filing date October 9, 2001. 
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BACKGROUND 



Field of the Invention 

[0003] The present invention relates to computer instruction code. More 
specifically, the present invention relates to a method and an apparatus that 
facilitates sharing computer instruction code between programs in a multitasking 
virtual machine. 

Related Art 

[0004] Computer programs written in languages such as JAVA™ are 
compiled into a platform-independent code, which is executed on a virtual 
machine, such as the JAVA VIRTUAL MACHINE (JVM). A program that has 
been compiled into a platform-independent code has the advantage that it can 
execute on a virtual machine regardless of the underlying central processing unit 
and native code. The terms JAVA, JVM and JAVA VIRTUAL MACHINE are 
trademarks of SUN Microsystems, Inc. of Palo Alto, California. 

[0005] A virtual machine typically includes an interpreter, which 
interprets the platform-independent code into native code to perform the desired 
operations. Interpreting the platform-independent code is an inherently slow 
operation; therefore, many virtual machines also include a dynamic compiler, 
which can compile the platform-independent code at runtime into the native code 
of the machine being used to host the virtual machine. Compiling the platform- 
independent code into the native code of the host machine can reduce the 
execution time of the program. 

[0006] Virtual machines for object-oriented programming languages with 
dynamic class loading typically load the code of a class when a program resolves a 
symbolic reference to that class for the first time. The class needs to be initialized 
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subsequently when the program uses it for the first time. Loading and 
initialization of a class are two separate events. Initialization of a class may never 
take place even though the class has been loaded. In the case of the Java 
programming language, the initialization of a class consists of executing some 
5 code, known as the class's static initializer, that brings the class's variables (also 
known as the static variables) to a well-defined initialized state. A virtual 
machine implementation may choose to set a class to the initialized state upon 
loading when no action is required to initialize that class. For instance, in the Java 
programming language, no action is required to initialize a class when this class 
10 has no declared static initialization sequence, and either no non-final static 
f* variables, or non-final static variables that are all declared to be set to a default 

13 value. In this case, a virtual machine implementation can benefit from setting 
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such initialization-less classes to the initialized state upon class loading. 

[0007] A class initialization barrier is a sequence of native instructions 
| * 1 5 that calls the virtual machine's runtime to initialize a class if it is not already 

g«t initialized. Class initialization barriers are included in the implementation of 

: * those platform-independent instructions that may result in the very first use of a 

class (in the case of the Java programming language, there are 4 such instructions: 
getstatic, putstatic, invokestatic, new). The implementation of a platform- 
20 independent instruction can come in two flavors: (i) as a sequence of instructions 
that is part of the implementation of an interpreter of platform-independent 
instructions, (ii) or as a sequence of instruction generated by a dynamic compiler 
of platform-independent instructions. 

[0008] Because class initialization barriers need only to be executed once 
25 per class, it is common practice in the implementation of non-multitasking virtual 
machines to have recourse to code-rewriting techniques to remove the class 
initialization barriers and the overhead they induce. In other words, a class 
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initialization barrier can simply be an instruction that calls the virtual machine 
runtime and that is removed, or replaced, by the runtime before returning to the 
call site. 

[0009] A multitasking virtual machine (MVM) aims at saving processing 
5 and memory by sharing as much of the runtime representation of a class as 
possible among tasks executing programs that use the class. Targets for sharing 
include the meta-data describing the class, the platform-independent code of the 
class's methods, and the native code produced by the dynamic compiler. Code re- 
writing techniques can be harmful when code, whether it is platform-independent 
10 or produced by a runtime compiler, is shared between multiple tasks. For 

I* instance, rewriting native code to remove a class initialization barrier is incorrect 

p 

when that code is shared between multiple tasks, since tasks that have not 
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initialized the corresponding class may subsequently be given access to that class 
'Q without the class being initialized first. Task re-entrant class initialization barriers 

I * 1 5 that test the initialization state of a class for the current task must be used instead. 

I«* However, these barriers may result in severe performance penalty, in particular, by 

preventing some useful optimization, such as method inlining. 
13 [0010] What is needed is a method and an apparatus that provides the 

iy benefits of sharing dynamically compiled native code among tasks of a 

20 multitasking virtual machine while eliminating the overhead resulting from the 
use of task re-entrant class initialization barriers. 

SUMMARY 

[0011] One embodiment of the present invention provides a method that 
25 enables native code, dynamically compiled from platform-independent code by 
one task of a multitasking virtual machine, to be used by other, potentially 
concurrent, tasks. Such sharing of dynamically compiled code reduces both the 
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amount of dynamic compilation performed by, and the memory footprint of each 
task. The dynamic compiler can produce two types of native code that can be 
shared among tasks: task re-entrant code, and class initialization dependent (CID) 
code. Task re-entrant code is guaranteed to be executable by any task, irrespective 
5 of the classes already initialized by the task. In order to achieve this, the dynamic 
compiler generates task re-entrant class initialization barriers anywhere in the 
code where a class might be used for the first time by a task (in the case of the 
Java programming language, a task re-entrant class initialization barrier must be 
generated before any sequence of instructions generated to implement any of the 
10 four bytecode instructions getstatic, putstatic, invokestatic and new.) When 
I* producing CID code, the compiler assumes that a task will execute this code only 

after the task has initialized all the classes from an initialized set associated with 
the code. This allows the compiler to avoid generating task re-entrant class 
initialization barriers for classes from this initialized set, which in turn may foster 



In) 

1 4 15 further optimizations, such as, in the case of the Java programming language. 
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Pi inlining of static methods, or inlining of methods that include access to static 

variables of a class. The runtime of the system must guarantee that a task uses 
CID code only when all the classes of the initialized set for this code have been 
initialized by the task. 
20 [0012] In one embodiment of the present invention, the system maintains 

only one version of dynamically compiled native code per method of a class, 
which may be either task re-entrant or class initialization dependent. Upon 
invocation of a class method without native code, the method's platform- 
independent code is interpreted, irrespective of the task invoking the method. 
25 After a threshold number of interpreted invocations, the dynamic compiler is 
invoked to build CID native code for the method. Upon invocation of a class 
method set with CID code, the native code is executed only for those tasks that 
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have already initialized all the classes specified in the initialized set of the native 
code; for all other tasks, the method's platform-independent code is interpreted. If 
a task fails to initialize all the classes of the initialized set of the CID code of a 
method after a threshold of the number of invocations of that method, a task re- 
5 entrant version of the native code of the method is produced to replace its CID 
code. Upon invocation of a class method set with task re-entrant native code, the 
native code is executed, irrespective of the task invoking the method. 

[0013] In one embodiment of the present invention, the system maintains 
two versions of dynamically compiled native code per method of a class, one that 
10 is task re-entrant, and the other that is class initialization dependent. Upon 
M» invocation of a method without native code, the method's platform-independent 
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Q code is interpreted, irrespective of the task invoking the method. After a threshold 

IS* 

J f '| number of interpreted invocations, the dynamic compiler is invoked to build CID 
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native code for the method. Upon invocation of a class method set with CID 

l J 

f * 1 5 code, the native code is executed only for those tasks that have already initialized 

!<•) all the classes specified in the initialized set of native code; for all other tasks, if 

f * s the method also includes task re-entrant native code, this one is executed. 

I* 

Otherwise, the method's platform-independent code is interpreted. After a 
threshold of the number of invocations of a method without task re-entrant native 
20 code but with CID code, if a task fails to initialize all the classes of the initialized 
set of the CID code, task re-entrant native code is produced for the method and 
added to it. The CID native code of a method that also has task re-entrant native 
code is replaced with another CID produced with a different initialized set after a 
threshold number of invocations to the method that did not result in executing the 
25 CID native code. After a threshold number of such replacement of CID code, the 
method is set with task re-entrant native code only. 



6 

Attorney Docket No. SUN-P6 1 1 9-RSH Inventors: Daynes et al, 

EJG EASUN MICROS YSTEMS\SUN-P6U9-RSH\SUN-P6 J 1 9-RSH APPLICATION DOC 



[0014] In one embodiment of the present invention, the system maintains 
more than one version of dynamically compiled native code per method of a class, 
wherein at least one version is task re-entrant, and the remaining versions differ by 
their initialized set and the optimizations they enabled. 
5 [0015] In one embodiment of the present invention, the initialized set used 

to build CID native code for a method is the intersection of the set of classes 
already initialized by the task that triggered the compilation with the set of classes 
whose initialization may be triggered by the method to be compiled. 

[0016] In one embodiment of the present invention, the initialized set used 
10 to build CID native code for a method is the set of all the classes whose 
I* initialization may be triggered by the method to be compiled. 

[0017] In one embodiment of the present invention, the initialized set used 
to build CID native code for a method is the intersection of each of the set of 
classes already initialized by all the running tasks and of the set of classes whose 
1 « 15 initialization may be triggered by the method to be compiled. 

[0018] In one embodiment of the present invention, the initialized sets of 
the CID native code of methods of the same class are the same, so that a call from 
13 the CID native code of a method to the CID native code of another method of the 

same class does not need testing beforehand, if the invoking task has initialized all 
20 the classes of the initialized set of the second method. 

[0019] In one embodiment of the present invention, when the overall size 
of compiled code storage hits a threshold of consumed space, the runtime of the 
system can remove the CID native code from the method when the method also 
includes task re-entrant native code. All subsequent invocations of the method 
25 result in executing the task re-entrant native code, irrespective of classes already 
initialized by the invoking task. 
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[0020] In one embodiment of the present invention, only methods that 
include a number of class initialization barriers greater than a threshold are 
candidates for compilation into CID native code. Otherwise, these methods are 
compiled directly into task re-entrant native code. 

BRIEF DESCRIPTION OF THE FIGURES 
[0021] FIG. 1 illustrates computing device 100 in accordance with an 

embodiment of the present invention. 

[0022] FIG. 2 is a diagram illustrating the evolution of the representation 

of a shared class method, initially with platform-independent code, then 

augmented with CID native code and task re-entrant native code in accordance 

with an embodiment of the present invention. 

[0023] FIG. 3 illustrates the data structures for keeping track of the CID 

native code each task can use in accordance with an embodiment of the present 

invention. 

[0024] FIG. 4A is a flowchart illustrating how reversed initialized sets are 
created, and updated after creation of CID native code. 

[0025] FIG. 4B is a flowchart illustrating the action performed upon 
initialization of a class by a task to update information that keeps track of what 
CID native code can be used by that task in accordance with an embodiment of 
the present invention. 

[0026] FIG. 5 illustrates how method descriptors keep track of what native 
code entry point to use upon invocation of the corresponding method in 
accordance with an embodiment of the present invention. 

[0027] FIG. 6A is a flowchart illustrating the actions performed upon 
entering the CID native code of a method in order to decide whether the current 
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task can execute the CID native code or must interpret the platform-independent 
code of the method in accordance with an embodiment of the present invention. 

[0028] FIG. 6B is a flowchart illustrating the actions performed upon 
entering the CID native code of a method in order to decide whether the current 
task can execute the CID native code or must execute the task re-entrant native 
code of the method in accordance with an embodiment of the present invention. 

DETAILED DESCRIPTION 
[0029] The following description is presented to enable any person skilled 
in the art to make and use the invention, and is provided in the context of a 
particular application and its requirements. Various modifications to the disclosed 
embodiments will be readily apparent to those skilled in the art, and the general 
principles defined herein may be applied to other embodiments and applications 
without departing from the spirit and scope of the present invention. Thus, the 
present invention is not intended to be limited to the embodiments shown, but is 
to be accorded the widest scope consistent with the principles and features 
disclosed herein. 

[0030] The data structures and code described in this detailed description 
are typically stored on a computer readable storage medium, which may be any 
device or medium that can store code and/or data for use by a computer system. 
This includes, but is not limited to, magnetic and optical storage devices such as 
disk drives, magnetic tape, CDs (compact discs) and DVDs (digital versatile discs 
or digital video discs), and computer instruction signals embodied in a 
transmission medium (with or without a carrier wave upon which the signals are 
modulated). For example, the transmission medium may include a 
communications network, such as the Internet. 
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Computing Device 

[0031] FIG. 1 illustrates computing device 100 in accordance with an 
embodiment of the present invention. Computing device 100 can generally 
include any type of computer system, including, but not limited to, a computer 
system based on a microprocessor, a mainframe computer, a digital signal 
processor, a portable computing device, a personal organizer, a device controller, 
and a computational engine within an appliance. Computing device 100 includes 
multitasking virtual machine 102. 

[0032] Multitasking virtual machine 102 executes platform-independent 
code on behalf of multiple tasks such that each task is provided with the illusion 
that it is the only task being executed. Multitasking virtual machine 102 includes 
shared runtime storage 104, tasks 106, 108, and 1 10, platform-independent code 
interpreter 112, and dynamic compiler 1 14. Note that multitasking virtual 
machine 102 may include more or less tasks than the number shown. 

[0033] Shared runtime storage 104 contains the runtime data structures 
that can be shared among the tasks executed by multitasking virtual machine 102. 
Data structures that make the shared runtime representation of classes, such as, 
platform-independent code of class methods, dynamically compiled native code 
for class methods, and other class information (e.g., in the case of the Java 
programming language, the constant pool, the field, method, and interface arrays, 
etc) are stored in the shared runtime storage. Tasks 106, 108, and 1 10, maintain 
what is private to them in their respective private storage area. The initialization 
states of the classes used by a task and their static variables are example of task 
private data. Platform-independent code interpreter 112 interprets the platform- 
independent code for tasks 106, 108, and 110. Dynamic compiler 1 14 serves the 
requests issued by the runtime of virtual machine 102 to compile platform- 
independent code to native code of computing device 100. Converting the 
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platform-independent code to the native code of computing device 100 can 
increase the efficiency and throughput of computing device 100. Typically, 
dynamic compiler 114 converts the platform-independent code to the native code 
of computing device 100 after a threshold number of invocation of the platform- 
independent code. Since converting the platform-independent code to the native 
code of computing device 100 is a costly process in terms of processing time, the 
use of dynamic compiler 1 14 is delayed to ensure that the cost of conversion will 
be recovered in the faster execution time of the platform-independent code. 
Conversion to native code by dynamic compiler 1 14 is performed on a method- 
by-method basis (i.e., some class methods may be converted to native code by the 
dynamic compiler and some may not be converted). 

[0034] The native code generated by dynamic compiler 1 14 may be either 
task re-entrant or class initialization dependent. Both type of native code are 
stored in shared runtime storage. 

Code Progression 

[0035] FIG. 2 is a diagram illustrating the state transitions of a class 
method and the event that triggers each state transition, in accordance with an 
embodiment of the present invention. Multitasking virtual machine 102 sets 
method 200 to the initial state 224 upon loading of the class that defines method 
200. In initial state 224, all invocations of method 200, irrespective of the task 
carrying the invocation, result in interpreting the corresponding platform- 
independent code 202. Multitasking virtual machine 102 keeps track of the 
number of times method 202 has been interpreted. 

[0036] After the number of interpreted invocations of method 200 reaches 
the compilation threshold 208, dynamic compiler 1 14 converts platform- 
independent code 202 into CID native code 204 using an initialized set that 
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enumerates a set of classes that the compiler must assume initialized. Dynamic 
compiler 1 14 uses the initialized set to avoid generating task re-entrant class 
initialization barriers directed to any of the classes in the initialized set. After 
creation of CID native code 204, multitasking virtual machine 102 sets method 
200 into state 226. In state 226, invocations to method 200 result in interpretation 
of platform-independent code 202, if at least one of the classes in the initialized 
set of CID native code 204 is not initialized by the invoking task. Otherwise, the 
invocation of method 200 results in executing CID native code 204. Multitasking 
virtual machine 102 keeps track of the number of times each task interprets 
method 200 in state 226. 

[0037] Some tasks may not reach a state quickly enough where they can 
use CID native code 204. Multitasking virtual machine 102 detects this situation 
by examining the number of times the invoking task has interpreted method 200 
before each interpretation of method 200. If the number of times is greater than a 
blocking threshold 210, the dynamic compiler is requested to produce task re- 
entrant native code 208. This can be done either by converting platform- 
independent code 202, or by modifying a copy of CID native code 204. The latter 
is usually substantially faster, because it does not involve complex compiler 
analysis. Instead, instructions corresponding to task re-entrant class initialization 
barriers are inserted at locations previously recorded during the building of CID 
native code 204. Inserting such instructions may cause further updates because of 
code relocation (i.e., branch instruction may need updates as well). However, 
even with these complications, the cost of updating a copy of CID native code 
remains substantially cheaper than recompiling the corresponding platform- 
independent code. After creation of task re-entrant native code 206, multitasking 
virtual machine 102 sets method 200 into state 228. In state 228, invocations to 
method 200 result in execution of task re-entrant native code 206, if at least one of 
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the classes in the initialized set of CID native code 204 is not initialized by the 
invoking task. Otherwise, the invocation of method 200 results in execution of 
CID native code 204. 

[0038] In state 228, several versions of optimized native code may exist, 
5 each with unique requirements for use. Additional versions of optimized native 
code depend upon having available space within memory 104. If space is needed 
within memory 104 for additional classes, the additional versions of optimized 
native code and un-optimized native code 206 can be removed by code cache 
space reclamation 212, returning the system to state 226. The favored 
10 combination is one that maintains at most two versions of native code per method, 
I * one of which is task re-entrant. 

.JWK 

u 

: % Selection of code to execute 

10 [0039] FIG. 3 illustrates an example of data structures maintained by an 

}* 15 implementation of a multitasking virtual machine to keep track of the CID native 

|U code that each task can execute. All CID native codes are recorded in CID native 

^ code table 306. The index to the entry of table 306 that records a CID native code 

Q is used also to uniquely identify that CID native method. The header of a CID 



native code (e.g., native code 302 and 304) includes its unique identifier, and a 
20 pointer to the initialized set used to build the CID native code. The initialized set 
of a CID code can be implemented as an array of pointers to shared runtime 
representation of classes (e.g., initialized sets 310 and 312 for CID native code 
302 and 304, respectively). Classes that belong to at least one initialized set of a 
CID native code are associated with a reverse initialized set, typically 
25 implemented as a bitmap for compactness. The reverse initialized set of a class 
records which CID native codes include that class in their initialized set. Reverse 
initialized set bitmaps have as many bits as there are entries in CID native code 
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table 306. The unique identifier of a CID native code is used to index the bitmaps 
that implement reverse initialized sets. For instance, CID native code 304 is 
recorded in entry Y of table 306, and includes class 316 in its initialized set; the 
bit at position Y in the reverse initialized set 320 associated with class 316 is set 
5 to 1 to indicate that class 3 16 is in the initialized set of native code CID 304 
whose identifier is Y. Additionally, each task maintains an array of invocation 
counters that keeps track of the number of invocations of a method set with CID 
native code that did not result in executing that CID native code. As for the 
reverse initialized sets, these arrays of counters have as many entries as the CID 
10 native code table 306, and the unique identifier of a CID native code is used to 
I a index the arrays to obtain the invocation counter of a task for that native code. 

if ^ 

^ Each thread running on behalf of a particular task holds in its descriptor a pointer 

B "f" to the array of counters of their task. A negative counter value indicates that the 

|y s 

|y task has initialized all the classes of the initialized set of the corresponding CID 

13 

yr 1 5 native code, and therefore, invocation to the corresponding method can execute 

the CID native code. For instance, invocations of method 308 issued by threads 
322 and 324 result in executing CID native code 302, as indicated by entry X of 
counter array 328. However, invocations of method 308 issued by thread 326 
result in either interpreting the platform-independent code for method 308, or task 
20 re-entrant native code, depending on the state of method 308. 

[0040] FIG. 4A is a flowchart illustrating how reversed initialized sets are 
created, and updated after creation of CID native code. When the dynamic 
compiler builds a CID native code, it adds the CID native code to the reverse 
initialized set of each of the classes of its initialized set. For each such class, it 
25 first tests whether the shared runtime representation of the class is already 

associated with a reverse initialized set (step 404). If not, one is created (step 
406), and its pointer stored in the shared runtime representation of the class (step 
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408). The identifier of the CID native code is then added to the reverse initialized 
set (step 410). 

[0041] FIG. 4B is a flowchart illustrating the action performed upon 
initialization of a class by a task to update information that keeps track of what 
CID native code can be used by that task in accordance with an embodiment of 
the present invention.. Upon initialization of a class by a task, the multitasking 
virtual machine scans the reversed map of the class to find all the CID native code 
that includethat class in their initialized set (step 412). The initialized set of each 
such CID native code is then scanned (step 414) to determine if all the classes in 
the set, other than the one being initialized, are already initialized (step 416 and 
418). If this is the case, the counter corresponding to that CID native method for 
the initializing task is set to -1 to indicate that from now on, invocations of the 
corresponding method by the threads of that task can execute the CID native code 
(step 420). 

[0042] FIG. 5 illustrates how method descriptors encode the state of a 
method and indicate where to transfer control to execute the method upon its 
invocation. The native code to execute upon invocation is indicated by a native 
code entry point field in the descriptor of the method. The dynamic compiler uses 
the code entry point field to generate call instructions that implement invocation 
to that method; the interpreter uses the code entry point field to determine when 
native code is available, and which entry point to native code it should use; and 
the runtime of the dynamic compiler uses the code entry point field to update, 
upon method invocation, call sites that became obsolete because of a transition of 
the state of the invoked method. When method 5 10 has no native code, the native 
code entry point field is set to null, indicating that control must be transferred to 
the interpreter, which will execute the platform-independent code of method 510. 
When a method has only one native code (whether it is task re-entrant as when 
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method 510 is in state 508, or class initialization dependent, as when it is in state 
504), invocations of the method always results in executing the entry point of the 
native code. When a method has both task re-entrant and CID native code (as 
when method 510 is in state 506), its invocation always results in executing an 
entry point of its CID native code. In all cases, CID native code is responsible for 
determining whether it can be used by the task that performs the invocation. CID 
native code 512 for method 502 contains two guarded entry points: guarded entry 
point 514 is used when the method does not have task re-entrant native code in 
addition to its CID native code 510; guarded entry point 516 is used when the 
method also has task re-entrant native code 510. In addition to the guarded entry 
point, an unguarded entry point is also available when testing whether the 
invoking thread can use the CID native code is unnecessary. This third entry point 
is entered from the two guarded entry points. It may also be entered directly when 
the invoker method has the same, or a superset, of the initialized set of the 
invoked CID code. For instance, a dynamic compiler may choose to compile all 
methods of a given class with the same initialized set. In this case, all invocations 
from the CID native code of a method of a class to the CID native code of another 
method of the same class can use the unguarded entry. Similarly, recursive 
invocation can use the unguarded entry directly. 

[0043] FIG. 6A is a flowchart illustrating the actions performed upon 
entering the CID native code of a method without task re-entrant native code. In 
this case, the guarded entry point must determine whether the task on behalf of 
which the current thread runs has initialized all the classes of the initialized set of 
the CID native method. This is accomplished by examining the invocation 
counter that corresponds to the CID native code. The array of invocation counters 
for CID native code for the current task is first obtained from the descriptor of the 
current thread (step 602). The appropriate counter is then obtained from this array 
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by indexing the array with the unique identifier of the CID native code (step 604). 
The value of the counter is then tested (step 606). A negative value means that 
the task has initialized all the classes listed in the initialized set of the CID native 
code, and the thread can execute the CID native code (step 618). Otherwise, the 
counter is incremented (step 608), and its value compared with that of a blocking 
threshold (step 610). If the counter is greater than the threshold, the dynamic 
compiler is invoked to build task re-entrant native code for the method (step 612), 
and execution is transferred to this task re-entrant native code (step 614). 
Otherwise, control is transferred to the interpreter, which executes the method's 
platform-independent code (step 616). Note that once the task re-entrant code is 
built, the native code entry point encoded in the method descriptor is modified to 
refer to the second guarded entry point of the CID native code, whose actions are 
described below in conjunction with FIG. 6B. 

[0044] It is possible that all the classes in the initialized set of a CID 
native code have been initialized by a task, but that that task's counter for this 
CID native code was not set to -1 to indicate so. This situation can happen when 
the CID native code is created by another task, after the first task has initialized all 
the classes of the initialized set of the CID native code. The counter is not 
updated in this case because determining all the tasks that have initialized all the 
classes of the initialized set of the CID code can be expensive. Instead, the 
runtime waits for a task in this situation to reach the blocking threshold to test 
again the initialization status of the classes in the initialized set of the CID code 
for the task. If the task has initialized all the classes, its counter for the CID code 
is set to -1 and the method invocation resumes by executing the CID native code. 

[0045] Below is a sequence of assembler code for SPARC™ V9 
processors that corresponds to the action described in the flowchart of FIG. 6 A. 
SPARC is a trademark or registered trademark of SPARC International, Inc. in the 
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United States and other countries. Note that the tasks that can execute the CID 
native code pay only the overhead of the first four instructions of the entry point 
(to test if the task has initialized all the classes in the initialized set of the CID 
native code). This corresponds to the overhead of a single task re-entrant class 
initialization barrier. The remaining instructions are executed only when the 
method must be interpreted, and processing overhead is not as crucial for 
performance in this case. Note also the use of an atomic compare-and-swap (cas) 
instruction to update the invocation counter, in order to avoid race conditions with 
class initialization code that may set the counter to -1 upon initialization of the 
last non-initialized class of the initialized set of the CID native code. Immediately 
after the cas instruction, the exchanged value of the counter is tested again in case 
a concurrent thread did change the counter to indicate that the task can now use 
the CID native code. 

1. Id [r_thread + of f set_to_counter_table] , ratable 

2. mov IDENTIFIER, r^index 

3. Id [ratable + r_index] , r_counter 

4. brlz, r_counter, unguarded__cid_entry_point 

5 . nop 

6. add r_counter, 1, r__new_counter 

7. sethi hi { interpreter_ent ryjpoint ) , r_iep 

8. cas [ratable + r_index] , recount er, new_counter 

9. brlz new_counter, unguarded_cid_entry_point 

1 0 . nop 

11. set BLOCKING_JrHRESHOLDl r rb 

12. cmp rb, new_counter 

13. be call_dynamic_compiler 

14 . nop 

/* jump to the interpreter entry point*/ 

15. jmp r_iep + low ( interpreter_entry__point ) 

16. nop 
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[0046] FIG. 6B is a flowchart illustrating the actions performed upon 
entering the CID native code of a method that also has task re-entrant native code. 
As before, the entry point must determine whether the task on behalf of which the 
current thread runs has initialized all the classes of the initialized set of the CID 
5 method. This is determined as in the other guarded entry point. The array of 
invocation counters for CID native code for the current task is first obtained from 
the descriptor of the current thread (step 620). From this array, the counter for the 
CID native code is then obtained by indexing the array with the unique identifier 
of the CID native code (step 622). The value of the counter is then tested (step 
10 624). A negative value means that the task has initialized all the classes listed in 
the initialized set of the CID native code, and the thread can execute the CID 
native code (step 628). Otherwise, if the counter has a value of 0, control is 
immediately transferred to the available task re-entrant native code (step 626), 
which is then executed (step 638). Otherwise, the counter is incremented (step 



o 

j 2 15 630), and its value tested against a blocking threshold of the number of 



invocations of the method that could not result in executing its CID native code 
I* (step 632). If the counter's value is below the threshold, control is transferred to 

l» the available task re-entrant native code (step 638). Otherwise, a request to 

;~ asynchronously recompile the CID native code is issued to the dynamic compiler 

iU 

20 (step 634), the counter is set to 0 so that subsequent invocations directly result in 
executing the task re-entrant code of the method (step 636), and the task re-entrant 
code is executed (step 638). Note that in case the blocking threshold is hit, the 
invoker does not wait for the CID native code to be recompiled. The decision to 
recompile the method is ultimately taken by the dynamic compiler based on the 

25 feedback obtained from execution. For instance, it may be that only one task over 
a long period of time has hit the blocking threshold. In this case, the dynamic 
compiler may ignore the request submitted by that task since it does not benefit 
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the majority of tasks that used that method. For this reason, the request is 
submitted asynchronously, and a shorter path to executing the task re-entrant code 
is created by setting the invocation counter of the task to 0. 

[0047] Below is a sequence of assembler code for SPARC™ V9 
5 processors that corresponds to the action described in the flowchart of FIG. 6B. 
As before, the tasks that can execute the CID native code pay only the overhead of 
the first four instructions of the guarded entry point. Tasks that have reached the 
blocking threshold pay an additional overhead of three instructions (instructions 6, 
7, and 16) upon every invocation of the method. Note that the blocking threshold 
10 in this case can be much smaller than for the other entry points since it may pay 
off much earlier to switch to task re-entrant native code (since the switching does 
not involve any heavyweight compilation cost, as opposed to the case when the 
method does not have already task re-entrant native code). 



p 15 1. Id [r_thread + of f set_to_counter_table] , ratable 

2. mov IDENTIFIER, r^index 

3. Id [ratable + r_index] , r_counter 

4. brlz, r_counter, unguarded_cid__entry_point 

5 . nop 

M 20 6. brz, r_counter, execute_task_reentrant__code 

jj?! 7. sethi hi ( task_reentrant_entry__point ) , r_trep 

8. add recount er, 1, r_new_counter 

9. cas [ratable + r_index] , r_counter, new_counter 

10. brlz new_counter, unguarded_cid_entry__point 
25 11. nop 

12. set BLOCKING_THRESHOLD2, rb 

13. cmp rb, new_counter 

14. be call_dynaraic_compiler 

15. nop 

30 execute_task_reentrant_code : 

16. jmp r_trep, low (task_reentrant_entry__point ) 

17. nop 
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Choice of an initialized set 

[0048] When building CID native code, the dynamic compiler should 
choose an initialized set such that the code produced can be immediately used, at 
least by the task that requested the dynamic compilation. In other words, the 
5 chosen initialized set should be a subset of the set of classes already initialized by 
the task on behalf of which the compilation takes place. The initialized set must 
also be a subset of the comprehensive initialized set of the method being 
compiled, which is the set of all the classes whose initialization can be triggered 
by that method. Thus, a good compromise for an initialized set is the intersection 
10 between the set of classes already initialized by the task requesting the dynamic 
!* compilation (i.e., the requester), and the comprehensive initialized set of the 

]Z method. This intersection can be determined by first computing the 

!f S comprehensive initialized set of the method, then iterating over it to find which 

m ones have been initialized by the requester, and adding them to the initialized set. 

□ 

I 1 5 [0049] The comprehensive set of methods can be simply obtained by 

* m iterating over all the instructions of the platform-independent code of the method, 

I & and for each method requiring a class initialization barrier, add the targeted class 

jji? to the comprehensive initialized set. Computation of the comprehensive 

initialized set can be made opportunistically, at times when the platform- 
Ill 

20 independent code is already being iterated over. For instance, in the case of the 
Java programming language, the JVM performs an abstract interpretation of the 
platform-independent code to verify it. Verification can be modified to also 
compute the comprehensive initialized set of all the methods of a classes. 

[0050] Dynamic compilers commonly start with building an intermediate 

25 representation of the code they compile in order to ease subsequent code analysis 
and optimization phases. Building the intermediate representation requires 
iterating over all the instructions of the platform-independent code of the method, 
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and is a better place to build the comprehensive initialized set of a method. Thus, 
the comprehensive initialized set can be computed while building the intermediate 
representation, and the initialized set can be determined once the intermediate 
representation is built, just before the optimization and analysis phases of the 
compilation where the initialized set can be used. 

[0051] Note that while building native code, the dynamic compiler may 
identify classes for which the compiled method will never trigger an initialization 
class. Such classes are removed from both the comprehensive initialized set and 
the initialized set actually used. 

[0052] The foregoing descriptions of embodiments of the present 
invention have been presented for purposes of illustration and description only. 
They are not intended to be exhaustive or to limit the present invention to the 
forms disclosed. Accordingly, many modifications and variations will be apparent 
to practitioners skilled in the art. Additionally, the above disclosure is not 
intended to limit the present invention. The scope of the present invention is 
defined by the appended claims. 
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